Prosody Based Speech Segmentation

نویسنده

  • Toshie Hatano
چکیده

Two experiments were conducted to verify whether prosody can be a unit of phonological segmentation. In experiment 1, 24 participants were asked to rate meaningless speech imitating 40 meaningful sound sequences produced by one male speaker. It was found that 94.7% of the selected combinations conformed to Japanese accent rules. Similarly, in experiment 2, 19 participants were asked to rate meaningless speech imitating 76 meaningful sound sequences produced by a different male speaker. 92.8% of the combinations selected conformed to Japanese accent rules. These experiments suggest that native speakers of Japanese can also recognize segment boundaries based on prosody.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Prosody Modeling for Automatic Speech Recognition and Understanding

This paper summarizes statistical modeling approaches for the use of prosody (the rhythm and melody of speech) in automatic recognition and understanding of speech. We outline effective prosodic feature extraction, model architectures, and techniques to combine prosodic with lexical (word-based) information. We then survey a number of applications of the framework, and give results for automati...

متن کامل

Prosody Modelling for Syllable-based Speech Synthesis

Prosody model used in the syllable based speech synthesizer DEMOSTHENES is described in the paper. The paper focuses on the segmental structure, especially on the segmentation into rhythm units (prosodic phrases). Relations between prosodic segments and sentence constituents are also discussed.

متن کامل

Combining Words and Speech Prosody for Automatic Topic Segmentation

We present a probabilistic model that uses both prosodic and lexical cues for the automatic segmentation of speech into topic units. The approach combines hidden Markov models, statistical language models, and prosody-based decision trees. Lexical information is obtained from a speech recognizer, and prosodic features are extracted automatically from speech waveforms. We evaluate our approach o...

متن کامل

Fully automatic segmentation for prosodic speech corpora

While automatic methods for phonetic segmentation of speech can help with rapid annotation of corpora, most methods rely either on manually segmented data to initially train the process or manual post-processing. This is very time-consuming and slows down porting of speech systems to new languages. In the context of prosody corpora for text-to-speech (TTS) systems, we investigated methods for f...

متن کامل

Auditory evoked potentials reveal early perceptual effects of distal prosody on speech segmentation

Auditory evoked potentials reveal early perceptual effects of distal prosody on speech segmentation Mara Breen, Laura C. Dilley, J. Devin McAuley & Lisa D. Sanders To cite this article: Mara Breen, Laura C. Dilley, J. Devin McAuley & Lisa D. Sanders (2014) Auditory evoked potentials reveal early perceptual effects of distal prosody on speech segmentation, Language, Cognition and Neuroscience, 2...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006